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DESATURASE GENES AND THEIR USE 



The present invention relates, inter alia, to novel desaturases and to uses thereof. 

Over the last few years a number of microsomal and soluble fatty acid desaturases have 
been isolated from higher plants, most notably from Arabidopsis thaliana. This has 
resulted from a combined genetic and biochemical approach to the generation and 
complementation of mutant Arabidopsis lines defective in fatty acid desaturation or 
elongation (Somerville C, Browse J (1996) Trends Cell Biol 6, 148-1153). The importance 
of this approach has been validated by the isolation and characterisation of genes encoding 
microsomal desaturases such the A 12 (Okuley J, et al (1994), Plant Cell 6. 147-158) 
andA 15 (Arondel V, et aU (1992) Science 258, 1353-1355) desaturases (encoded by the 
FAD2 and FAD3 genes respectively), enzymes which had previously proved intractable to 
classical purification techniques on account of their hydrophobicity. The isolation of these 
and related enzymes, such as the A 12 hydroxylase from Ricinus communis (van de Loo FN 
et al (1995) Proc. Natl Acad. Sci USA 92, 6743-6747), has allowed the identification of a 
number of conserved motifs in plant microsomal desaturases, most notably the so-called 
"histidine boxes" (Shanklin, J et al (1997) Proc Natl Acad. Sci USA. 92, 6743-6747). 
Proteins containing these motifs can be classified as di-iron centre-containing enzymes 
(Shanklin, J etal (1997) Proc. Natl AcadSci. USA 94, 2981-1986). 



W093/ 11245 (Du Pont) discloses various nucleic acid fragments encoding desaturases, 
particularly A 12 and A 15 desaturases, which have been isolated from various plants. 
Recently a cDNA clone was isolated from the plant borage, (Borago officinalis) which 
accumulates y-linoleic acid (GLA), using highly degenerate PCR against these histidine 
motifs. US56 14393 (Rhone-Pouienc Agrochimie) discloses and claims the nucleotide 
sequence of borage A 6 desaturase. Whilst the specification suggests that A 6 
desaturase-encoding nucleic acids might be isolated from animal cells without difficulty by 
the skilled person no suitable animal cells arc suggested (in contrast to suggested fungal 
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and bacterial cells) and there is no disclosure of the isolation of such nucleic acids from 
animal cells. The isolated DNA clone was shown by heterologous expression in transgenic 
tobacco to encode a microsomal A 6 desaturase (Sayanova O et al (1997) Proc. Natl. Acad. 
Sci. USA. 94, 421 1-4216). Desaturation at the A 6 position is an unusual modification in 
higher plants, occurring only in a small number of species such as borage, evening 
primrose (Oenothera spp.) and redcurrant (Ribes spp.), which accumulate the A 6 - 
unsaturated fatty acids GLA and octadecatetraenoic acid (OTA:^: 4 * 69 - 1215 - also known as 
stearidonic acid) in the seeds and/or leaves. 

(\ GLA is a high value plant fatty acid, and is widely used in the treatment of a number of 
medical conditions, including eczema and mastalgia. It has been postulated that the 
application of GLA replaces the loss of, or meets an increased requirement for, endogenous 
A 6 -unsaturated fatty acids (Horrobin, D.F. (1990) Rev. Contemp. Pharmacother. 1: 1-45). 

For reference purposes Figure 5 is provided to show in simplified form a metabolic 
pathway believed to occur in certain organisms (including humans) and involving A 6 
desaturates. It can be seen that GLA can be synthesised in vivo from linoleic acid under the 
action of a A 6 desaturase and that GLA can be used to synthesise dihomo-GLA, which can 
be converted to arachidonic acid under the influence of a A 5 desaturase. Arachidonic acid 
is a precursor of various important eicosanoids (including prostaglandins and leucotrienes). 
A 6 desaturase also converts a linoleic acid into OTA. Thus it is clear that the A 6 desaturase 
is the first committed step on the biosynthetic pathway of these biologically active 
molecules (see Fig. 5). 

The sequence of the previously isolated borage microsomal A 6 desaturase differs from 
previously characterised plant microsomal desaturases/hydroxylases in that it contains an 
N-terminal extension which shows homology to cytochrome b s , and also in that the third 
(most C-terminal) histidine box varies from the consensus (Shanklin J et al (1997) Proc. 
Natl. Acad. Sci. USA 94, 2981-1986) H-X-X-H-H, with a glutamine replacing the first 
histidine. This was also observed in the case of the cyanobacteria Synechocystis A 6 
desaturase (GenBank ID; LI 1421). WO93/06712 (Rhone Poulenc Agrochimie) discloses 
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an isolated nucleic acid encoding a A 6 desaturase isolated from the Synechocystis, and 
claims bacterial A 6 desaturases and their uses. 

Although A 6 fatty acid desaturation is an unusual modification in higher plants, it is^ 
believed to be common in animals. The essential fatty acid linoleic acid (18:2 A 9 * 12 ) is 
desaturated to GLA by a A 6 desaturase as a first step in the biosynthetic pathway of the 
eicosanoids (which include prostagladins and leucotrienes). This results in the rapid j 

metabolism of GLA (to di-homo-GLA and arachidonic acid; i.e. 20:3 A 81114 and 20:4 f 

f 

A 5.8.n.H reS pectively). Accumulation of GLA is therefore not usually observed. 

The nematode worm Caenorhabitis elegans is extremely useful in that it has well 
understood genetics and has many similarities with higher animals such as humans and is 
therefore extremely useful in the development of desaturases for use in such animals. 

According to the present invention, there is provided a polypeptide having desaturase activity, 
whicKcomprises the amino acid sequence shown in Figure 1. 

The aminia acid sequence shown in Figure 1 is that of a A 6 desaturase that is present in the 
nematode wWi CaenorhabditisjUga^ is highly significant since prior to the present 
invention no s^ccess^l sequencing or purification of an animal A 6 desaturase had been 
reported. As C eh&ans does not accumulate GLA isolation of a A 6 desaturase from it was an 
unexpected target fori^olating desaturases gene in. 

The desaturase of the invention is significantly different from known desaturases. The 
homology between the A 6 desaturase of the invention and the microsomal A 12 and A 15 
desaturases from Arabidopsis described in W093/1 1245 are 24% and 16% respectively as 
determined using the BESTFTT program. The A 6 desaturase gene of the present invention 
shows 21% identity with the Celegans FAT- 1 desaturase described in Spychalla, J. P. et al 
Proc. Natl Acad. Sci 94 1 142-1 147 paper. The sequence homology between the A 6 desaturase 
of the present invention and the Synechodtocystis A 6 described in WO93/067 12 is only 23%. 

According to another aspect of the invention there is provided therefore an isolated animal A 6 
desaturase. 
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^he amino acid sequence shown in Figure 1 is also of significance because it has a very low 
leWl of sequence identity with the borage A 6 desaturase (the only other eukaryotic A 6 
desaturase to have been sequenced prior to the present invention). Indeed, this level of 
sequ^ce identity is below 32 %. At such a low level of identity it might be expected that the 
two poWpeptides would have completely different functions. Unexpectedly, both have A 6 
desaturasaactivity. 

The presentiinvention is, however, not limited to a A 6 desaturase having the sequence shown 
in Figure 1. It also includes other desaturases having at least 32% sequence identity therewith. 
Preferred polypeptides of the present invention have at least 40 % or more preferably at least 
50% amino Jbid sequence identity therewith. More preferably the degree of sequence identity 
is at least 75%. Sequence identities of at least 90%, at least 95% or at least 99% are most 
preferred. \ 

For the puiposes of the present invention, sequence identity (whether amino acid or 
nucleotide) can be determined by using the "BESTFTF* program of the Wisconsin Sequence 
Analysis Package GCG 8.0. 

Where high degrees of sequence identity are present there may be relatively few differences 
in amino acid sequence. Thus for example there may be less than 20, less than 10, or even 
less than 5 differences. 

Fragments of the polypeptides described above are also within the scope of the present 
invention, provided that they have desaturase activity, that is to say they have the ability to 
introduce a double bond into a substrate at a specific position as determined by GCMS. What 
is the lowest limit for activity. These fragments are preferably at least 100 amino acids long 
More preferably, the fragments are at least 150 amino acids long. 

In summary, ^polypeptide of the present invention has desaturase activity and: 

a) comprises fte amino acid sequence shown in Figure 1 ; 
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b) \ has one or more amino acid deletions, insertions or substitutions relative to a 
polypeptide as defined in a) above, but has at least 32% amino acid sequence identity 
therewith; or 

c) is a fragment of a polypeptide as defined in a) or b) above, which is at least 100 amino 
acids lo^jg. 

The term "polypeptide" is used herein in a broad sense to indicate that a particular molecule 
comprises a plurality of amino acids joined together by peptide bonds. It therefore includes 
within its scope substances, which may sometimes be referred to in the literature as 
peptides, polypeptides or proteins. 

ssirably a polypeptide of the present invention will have a cytochrome domain.. A 
cytochrome domain can be defined as an electron-transporting domain that contains a heme 
prosthetic group. Preferably a cytochrome b domain is present. More preferably a cytochrome 
b 5 doWin is present (desirably this includes a H-P-G-G-Xi 5 -F-X 3 -6-H, where X is any amino 
acid, iribtif). A cytochrome b 5 domain is present in both the borage A 6 desaturase and in the 
C. elegankA 6 desaturase amino acid sequence shown in Figure 2B The cytochrome b s domain 
is preferabl\an N-terminal domain - i.e. it is closer to the N-terminal end of the desaturase 
than to the cWminal end. This contrasts with other desaturases. For example, yeast A 9 
desaturase, has a^C-terminal cytochrome b 5 domain and plant A 12 and A 15 desasturases which 
do not have any bs qpmain. 

A polypeptide of the present invention preferably has one or more (most preferably three) 
histidine boxes. One of these may have an H->Q substitution. (This provides a variant 
histidine box that is believed to be conserved over a range of animal / plant species.) 

Polypeptides of the present invention can have any regiospecificity including cisltrans 
activity although it is preferred that they are front end desaturases that introduce a double 
bond between the C3 and C7 positions, measured from the COOH (A end) of the group. A 
skilled person is readily able to distinguish between different desaturases by determining the 
different positions of double bonds introduced by the desaturases. This can be done by known 
analytical techniques e.g. by using gas chromatography and mass spectrometry. 
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Particularly preferred desaturases of the invention are A 6 desaturases. 

Desirably the desaturases occur naturally in one or more organisms that do riot accumulate 
GLA (i.e. where GLA may be produced, but is not normally detectable because it is very 
quickly metabolised). Such desaturases may occur naturally in one or more animals. The 
desaturases occur naturally in one or more nematodes, e.g. in C. eleggns. 

In order to appreciate the scope of the present invention more fully, polypeptides within the 
scope of each of a), b) and c) above will now be discussed in greater detail. 

Polypeptides within the scope of a) 

A polypeptide within the scope of a) may consist of the amino acid sequence shown in Figure 
1 or may have an additional N-terminal and/or an additional C-terminal amino acid sequence. 

Additional N-terminal or C-terminal sequences may be provided for various reasons and 
techniques for providing such additional sequences are well known in the art. Such 
techniques include using gene-cloning techniques whereby nucleic acid molecules are ligated 
together and are then used to express a polypeptide in an appropriate host. 

Additional sequences may be provided in order to alter the characteristics of a particular 
polypeptide. This can be useful in improving expression or regulation of expression in 
particular expression systems. For example, an additional sequence may provide some 
protection against proteolytic cleavage. 

Additional sequences can also be useful in altering the properties of a polypeptide to aid in 
identification or purification. For example, a signal sequence may be present to direct the 
transport of the polypeptide to a particular location within a cell or to export the polypeptide 
from the cell. Different signal sequences can be used for different expression systems. 

Another example of the provision of an additional sequence is where a polypeptide is linked 
to a moiety^pable-of-being-isolated-ty^ moiet y may be an 

epitope and the affinity column may comprise immobilised antibodies or immobilised 
antibody fragments that bind to said epitope (desirably with a high degree of specificity). The 
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resultant fusion protein can usually be eluted from the column by addition of an appropriate 
buffer. 

Additional N-terminal or C-terminal sequences may, however, be present simply as a result of 
a particular technique used to obtain a polypeptide of the present invention and need not 
provide any particular advantageous characteristic. 

Polypeptides within the scope ofb) 

Turning now to the polypeptides defined in b) above, it will be appreciated that these are 
variants of the polypeptides given in a) above. 

Various changes can often be made to the amino acid sequence of a polypeptide which has a 
desired property in order to produce variants which still have that property. Such variants of 
the polypeptides described in a) above are within the scope of the present invention and are 
discussed in greater detail in sections (i) to (Hi) below. They include allelic and non-allelic 
variants. 

(i) Substitutions 

An example of a variant of the present invention is a polypeptide as defined in a) above, apart 
from the substitution of one or more amino acids with one or more other amino acids. 

The skilled person is aware that various amino acids have similar characteristics. One or 
more such amino acids of a polypeptide can often be substituted by one or more other such 
amino acids without eliminating a desired property of that polypeptide (such as desaturase 
activity). 

For example, the amino acids glycine, alanine, valine, leucine and isoleucine can often be 
substituted for one another (amino acids having aliphatic side chains). Of these possible 
substitutions it is preferred that glycine and alanine are used to substitute for one another 
(since they have relatively short side chains) and that valine, leucine and isoleucine are used 
to substitute for one another (since they have larger aliphatic side chains which are 
hydrophobic). Other amino acids that can often be substituted for one another include 
phenylalanine, tyrosine and tryptophan (amino acids having aromatic side chains); lysine, 
arginine and histidine (amino acids having basic side chains); aspartate and glutamate (amino 
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acids having acidic side chains); asparagine and glutamine (amino acids having amide side 
chains); and cysteine and methionine (amino acids having sulphur containing side chains). 

Substitutions of this nature are often referred to as "conservative" or "semi-conservative" 
amino acid substitutions. 

(ii) Deletions 

Amino acid deletions can be advantageous since the overall length and the molecular weight 
of a polypeptide can be reduced whilst still retaining a desired activity. This can enable the 
amount of polypeptide required for a particular purpose to be reduced. 

(Hi) Insertions 

Amino acid insertions relative to a polypeptide as defined in a) above can also be made. This 
may be done to alter the nature of the polypeptide (e.g. to assist in identification, purification 
or expression). 

Polypeptides incorporating amino acid changes (whether substitutions, deletions or 
insertions) relative to the sequence of a polypeptide as defined in a) above can be provided 
using any suitable techniques. For example, a nucleic acid sequence incorporating a desired 
sequence change can be provided by site-directed mutagenesis. This can then be used to 
allow the expression of a polypeptide having a corresponding change in its amino acid 
sequence. 

Polypeptides within the scope ofc) 

.discussed supra, it is often advantageous to reduce the length of a polypeptide. Feature c) 
of thWesent invention therefore covers fragments of the polypeptides a) or b) above which 
are at le^st 100 amino acids long, but which do not need to be as long as the full length 
polypeptide\shown in Figure 1. Desirably these fragments are at least 200, at least 300 or at 
least 400 arruiK\acids long. 

Various uses of the polypeptides of the present invention will now be described by way of 
example only. 
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Polypeptides of the present invention may be used, inter alia, in obtaining useful molecules. 
For example A 6 desaturases can be used in obtaining y-linolenic acid (GLA) or in obtaining 
metabolites in respect of which GLA is a precursor. For example, octadecatetraenoic acid 
(OTA; 18:4A 6A,2IS ), a member of the n-3 (or o>-3) fatty acids may be produced by the 
A 6 -desaturation of a-linolenic acid. 

GLA, OTA and their metabolites are useful in medicine. They can be used in the preparation 
of a medicament for treating a disorder involving a deficiency in GLA or of a metabolite 
derived in vivo from GLA (e.g. an eicosanoid). Disorders which may be treated include 
eczema, mastalgia, hypercholesterolemia, atherosclerosis, coronary disease, diabetic 
neuropathy, viral infections, acne, hypertension, cirrhosis and cancer. 

The metabolites may be produced in vivo in suitable hosts or in vitro. 

When a metabolite is to be produced in vitro, a desaturase of the present invention and its 
substrate will normally be provided separately and then combined when it is desired to 
produce the metabolite. The present invention therefore includes within its scope a method of 
making GLA or OTA comprising using a A 6 desaturase of the present invention to convert 
linoleic acid substrate or a-linolenic acid substrate to GLA or OTA respectively. 

When a metabolite is to be produced in vivo in a organism such as a plant or animal, the 
substrate for a desaturase of the present invention will normally be provided by the relevant 
non-human organism itself. In vivo production of the metabolite can therefore be achieved by 
inserting a gene encoding a desaturase of the present invention into the organism and 
allowing the organism to express the desaturase. The desaturase can then act on its substrate. 
It will therefore be appreciated that polypeptides of the present invention can be used to 
provide desaturase activity in organisms that would normally not possess such activity or to 
increase the level of desaturase activity in organisms already having some desaturase activity. 
If desired, a useful metabolite may be purified from such an organism. Alternatively the 
organism itself may be used directly as a source of the metabolite. Particular cloning 
techniques that can be used to provide transgenic organisms with desaturase activity are 
discussed later on. 
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Polypeptides of the present invention can also be used as indicators of the transformation of 
an organism. For example, if an organism intended to be transformed does not have a 
particular desaturase and a nucleic acid intended for use in transformation encodes that 
desaturase, an assay can be performed after attempted transformation to determine whether or 
not the desaturase is present. Thus, in the case of the A 6 desaturase, an assay for the presence 
of GLA may be performed and GLA can serve as a simple marker for the presence of a 
functional transgene cassette comprising a A 6 desaturase encoding sequence. 

A further use of the present invention is in providing antibodies. The present invention 
includes within its scope antibodies that bind to polypeptides of the present invention. 

Preferred antibodies bind specifically to polypeptides of the present invention and can 
therefore be used to purify such polypeptides. (For example, they may be immobilised and 
used to bind to polypeptides of the present invention. The polypeptides may then be eluted 
by washing with a suitable eiuent under appropriate conditions.) 

An antibody or a derivative thereof within the scope of the present invention may be used in 
diagnosis. For example binding assays using such an antibody or a derivative can be used to 
determine whether or not a particular desaturase is present. This is useful in diagnosing 
disorders that arise due to the absence of the functional desaturase. 

Antibodies within the scope of the present invention may be monoclonal or polyclonal. 

Polyclonal antibodies can be raised by stimulating their production in a suitable animal host 
(e.g. a mouse, rat, guinea pig, rabbit, sheep, goat or monkey) when a polypeptide of the 
present invention is injected into the animal. If necessary an adjuvant may be administered 
together with a polypeptide of the present invention. The antibodies can then be purified by 
virtue of their binding to a polypeptide of the present invention. 

Monoclonal antibodies can be produced from hybridomas. These can be formed by fusing 
myeloma cells and spleen cells which produce the desired antibody in order to form an 
immortal cell line. Thus the well-known Kohler & Milstein technique (Nature 256 52-55 
(1975)) or variations upon this technique can be used. 
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Techniques for producing monoclonal and polyclonal antibodies that bind to a particular 
polypeptide are now well developed in the art. They are discussed in standard immunology 
textbooks, for example in Roitt et al. Immunology second edition (1989), Churchill 
Livingstone, London. 

In addition to whole antibodies, the present invention includes derivatives thereof which are 
capable of binding to polypeptides of the present invention. Thus the present invention 
includes antibody fragments and synthetic constructs. Examples of antibody fragments and 
synthetic constructs are given by Dougall et al in Tibtech 12 372-379 (September 1994). 

s 

Antibody fragments include, for example, Fab, F(ab')2 and Fv fragments. (These are 
discussed, for example, in Roitt et al (supra)). Fv fragments can be modified to produce a 
synthetic construct known as a single chain Fv (scFv) molecule. This includes a peptide 
linker covalently joining V h and V, regions, which contributes to the stability of the molecule. 
Other synthetic constructs that can be used include CDR peptides. These are synthetic 
peptides comprising antigen-binding determinants. Peptide mimetics may also be used. 
These molecules are usually conformationally restricted organic rings that mimic the structure 
of a CDR loop and that include antigen-interactive side chains. 

Synthetic constructs include chimaeric molecules. Thus, for example, humanised (or 
primatised) antibodies or derivatives thereof are within the scope of the present invention. An 
example of a humanised antibody is an antibody having human framework regions, but 
rodent hypervariable regions. 

Synthetic constructs also include molecules comprising an additional moiety which provides 
the molecule with some desirable property in addition to antigen binding. For example the 
moiety may be a label (e.g. a fluorescent or radioactive label). Alternatively, it may be a 
pharmaceutically active agent. 

The present invention also includes nucleic acid molecules within its scope. 
Such nucleic acid molecules: 



code for a polypeptide according to the present invention; or 
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b) are complementary to molecules as defined in a) above; or 

c) hybridise to molecules as defined in a) or b) above. 

These nucleic acid molecules and their uses are discussed in greater detail below: 

The polypeptides of the present invention can be coded for by a large variety of nucleic acid 
molecules, taking into account the well-known degeneracy of the genetic code. All of these 
coding nucleic acid molecules are within the scope of the present invention. Preferred coding 
nucleic acid molecules encode the polypeptide shown in Figure 1. These include nucleic acid 
molecules comprising the coding sequence shown in Figure 1 and degenerate variants 
thereof. 

The nucleic acid molecules may be used directly. Alternatively they may be inserted into 
vectors. 

Nucleic acids or vectors containing them may be used in cloning. They may be introduced 
into non-human hosts to enable the expression of polypeptides of the present invention using 
techniques known to those skilled in the art. Alternatively, cell free expression systems may 
be used. 

Techniques for cloning, expressing and purifying polypeptides are well known to the skilled 
person. Various such techniques are disclosed in standard text-books, such as in Sambrook et 
al (Molecular Cloning 2nd Edition, Cold Spring Harbor Laboratory Press (1989)); in Old & 
Primrose (Principles of Gene Manipulation, 5th Edition, Blackwell Scientific Publications 
(1994)); and in Stryer (Biochemistry, 4th Edition, W H Freeman and Company (1995)). 

By using an appropriate expression system the polypeptides can be produced in a desired 
form. For example, the polypeptides can be produced by micro-organisms such as bacteria 
or yeast, by cultured insect cells (which may be baculovirus-infected), or by mammalian 
cells (such as CHO cells). 

However preferred hosts arc plants or plant propagating material e.g. oil seed rape, 
sunflower, cereals including maize, tobacco, legumes including peanut and soybean, 



WO 99/27111 




PCT/GB98/03507 



safflower, oil palm, coconut and other palms, cotton, sesame, mustard, linseed, castor, 
borage and evening primrose, or propagating material therefor. 

The technology for providing plants or plant propagating material is now well developed. It is 
briefly discussed in WO 96/21022, for example. Desaturases isolated from animals have 
successfully been expressed in plants. For example, Spychalla, J.P. et al f (supra) describe 
the expression of a C. elegans desaturase in transgenic Arabidopsis. Additionally, 
EP0550162 (Pioneer Hi-Bred International, Inc) discloses a chimaric gene construct 
encoding a A 9 desaturase isolated from rat, and plants transformed with the construct for 
the production of fatty acids. The desaturase described in that publication has only 22% 
identity with the A 6 desaturase of the present invention. 

Particular techniques that can be used are discussed below. It will of course be appreciated 
that such techniques are non-limiting. 

(i) Vector systems based on Agrobacterium tumefaciens. 

These include Ti based systems, such as pGV3850, in which the T-DNA has been disarmed. 
Desirably a selectable marker is present (e.g. a marker that provides resistance to an 
antibiotic). 

Intennediate vectors (IVs) may also be used. They tend to be small in size and are therefore 
usually easier to manipulate than large Ti based vectors. IVs are generally vectors resulting 
from T-DNA having been cloned into E. coli derived plasmid vectors, such as pBR322. IVs 
are often conjugation-deficient and therefore a conjugation-proficient plasmid (such as 
pRK2013) may be used to mobilise an IV so that it can be transferred to an Agrobacterium 
recipient. In vivo homologous recombination can then occur in an Agrobacterium to allow an 
IV to be inserted into a resident, disarmed Ti plasmid in order that a cointegrate can be 
produced that is capable of replicating autonomously in the Agrobacterium. 

Another alternative is to use binary Ti vectors. Here a modified T-DNA region carrying 
foreign DNA can be provided on a small plasmid that replicates in £. coli (e.g. pRK252). 
This plasmid (sometimes called mini-Ti or micro-Ti) can then be transferred conjugatively 
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via a tri-parental mating into an A. tumefaciens that contains a compatible vir gene 
(providing the vir function in trans). 

Binary vectors without Ti sequences may even be used. Here bacterial mob and oriT 
functions may be used to promote plasmid transfer. Again, the vir function may be provided 
in trans . 

The vector systems discussed above can be used to transfer genes into plants by using the 
protocol of Horsch et al. (Science 227, 1229-31 (1985)) or variants thereof. Here small discs 
can be punched from the leaves of a dicotyledenous plant, they can be surface-sterilised, and 
can then be placed in a medium including A. tumefaciens that contains recombinant T-DNA 
in which a foreign gene to be transferred is accompanied by a selectable marker (e.g. the neo 
gene). The discs can then be cultured for 2 days and then transferred to a medium for 
selecting the selectable marker. (This can be done for a neo selectable marker by culturing 
using a medium containing kanamycin). A. tumefaciens can be killed by using a carbenicillin 
containing medium. Shoots will normally develop from a callus after 2-4 weeks. They can 
then be excised and transplanted to root-inducing medium and, when large enough can be 
transplanted into soil. 

(ii) Vector systems based on Agrobacterium rhizogenes 

These include Ri derived plasmids. Ri T-DNA is generally considered not to be deleterious 
and therefore such plasmids can be considered as equivalent to disarmed Ti plasmids. An IV 
co-integrate system based on Ri plasmids has been developed. 

(Hi) Plant protoplast based transformation systems 

Suitable techniques are described in "Plant Gene Transfer and Expression Protocols" ed. H. 
Jones, Human Press Methods in Molecular Biology, 49, 1995. 

Transformation of plants can be facilitated by removing plant cell walls to provide 
protoplasts. The cell walls can be removed by any suitable means, including mechanical 
disruption or treatment with cellulolytic and pectinolytic enzymes. Protoplasts can then be 
separated from other components by centrifugation and techniques such as electroporation 
can then be used to transform the protoplasts with heterologous DNA. Under appropriate 



WO 99/27111 




PCT/GB98/03507 



culture conditions the transformed protoplasts will grow new cell walls and also divide. 
Shoots and roots can then be induced and plantlets formed. 

(iv) Transfection by biolistics 

High velocity microprojectiles carrying DNA or RNA can be used to deliver that DNA or 
RNA into plant cells. This has allowed a wide variety of transgenic plants to be produced and 
is suitable for both monocotyledenous and dicotyledenous plants. For example gold or 
tungsten particles coated with DNA or RNA can be used. Suitable devices for propelling the 
microprojectiles include gunpowder based devices, electric discharge based devices and 
pneumatic devices. 

(v) Virus based systems 

DNA plant virus vectors include cauliflower mosaic viruses (which infect a range of dicots.) 
and geminiviruses ( which infect a wide range of dicots. and monocots). RNA plant viruses 
are in the majority and include Brome Mosaic Virus (which infects a number of Graminae, 
including barley) and Tobacco Mosaic Virus (which infects tobacco plants). 

From the foregoing description it will be appreciated that nucleic acid molecules encoding 
polypeptides of the present invention can be cloned and expressed in a wide variety of 
organisms. 

In addition to nucleic acid molecules coding for polypeptides of the present invention 
(referred to herein as "coding" nucleic acid molecules), the present invention also includes 
nucleic acid molecules complementary thereto. Thus, for example, both strands of a double 
stranded nucleic acid molecule are included within the scope of the present invention 
(whether or not they are associated with one another). Also included are mRNA molecules 
and complementary DNA molecules (e.g. cDNA molecules). 

Nucleic acid molecules that can hybridise to one or more of the nucleic acid molecules 
discussed above are also covered by the present invention. Such nucleic acid molecules are 
referred to herein as "hybridising" nucleic acid molecules. 
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A hybridising nucleic acid molecule of the present invention may have a high degree of 
sequence identity along its length with a nucleic acid molecule within the scope of a) or b) 
above (e.g. at least 50%, at least 75% or at least 90% sequence identity). 

As will be appreciated by those skilled in the art, the greater the degree of sequence identity 
that a given single stranded nucleic acid molecule has with another single stranded nucleic 
acid molecule, the greater the likelihood that it will hybridise to a single stranded nucleic acid 
molecule which is complementary to that other single stranded nucleic acid molecule under 
appropriate conditions. 

Desirably hybridising molecules of the present invention are at least 10 nucleotides in length 
and preferably are at least 25, at least 50, at least 100 or at least 200 nucleotides in length. 

Preferred hybridising molecules hybridise under stringent hybridisation conditions. One 
example of stringent hybridisation conditions is where attempted hybridisation is carried out 
at a temperature of from about 35°C to about 65°C using a salt solution that is about 0.9 
molar. However, the skilled person will be able to vary such parameters as appropriate in 
order to take into account variables such as probe length, base composition, type of ions 
present, etc. 

Most preferably, hybridising nucleic acid molecules of the present invention hybridise to a 
DNA molecule having the coding sequence shown in Figure 1 to an RN A equivalent thereof, 
or to a complementary sequence to either of the aforesaid molecules. 

Hybridising nucleic acid molecules can be useful as probes or primers, for example. 

Probes can be used to purify and/or to identify nucleic acids. For example they can be used to 
identify the presence of all or part of a desaturase gene and are therefore useful in diagnosis. 

Primers are useful in amplifying nucleic acids or parts thereof, e.g. by PCR techniques. 

In addition to being used as probes or primers, hybridising nucleic acid molecules of the 
present invention can be used as antisense molecules to alter the expression of polypeptides 
of the present invention by binding to complementary nucleic acid molecules. (Generally this 
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can be achieved by providing nucleic acid molecules that bind to RNA molecules that would 
normally be translated, thereby preventing translation due to the formation of duplexes.) 

Hybridising molecules may also be provided as ribozymes. Ribozymes can also be used to 
regulate expression by binding to and cleaving RNA molecules that include particular target 
sequences recognised by the ribozymes. 

From the foregoing discussion it will be appreciated that a large number of nucleic acids are 
within the scope of the present invention. Unless the context indicates otherwise, nucleic acid 
molecules of the present invention may therefore have one or more of the following 
characteristics: 

1) They may be DNA or RNA (including variants of naturally occurring DNA or RNA 
structures, which have non-naturally occurring bases and/or non-naturally occurring 
backbones). 

2) They may be single or double stranded. 

3) They may be provided in recombinant form i.e. covalently linked to a heterologous 5' 
and/or 3' flanking sequence to provide a chimaeric molecule (e.g. a vector) which 
does not occur in nature. 

4) They may be provided without 5' and/or 3' flanking sequences that normally occur in 
nature. 

5) They may be provided in substantially pure form, e.g. by using probes to isolate 
cloned molecules having a desired target sequence or by using chemical synthesis 
techniques. Thus they may be provided in a form which is substantially free from 
contaminating proteins and/or from other nucleic acids. 



They may be provided with introns (e.g. as a full-length gene) or without introns (e.g. 
as cDNA). 
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The present invention will now be described by way of example only, with reference to the 
accompanying drawings, Figures 1 to 6 wherein: 

Fig 1 shows the DNA sequence and the deduced amino acid sequence of the full length - 
C. elegans cDNA pCeD6.1. The positions of the N-terminal cytochrome b 5 domain and the 
variant third histidine box are underlined. The deduced amino acid sequence of this cDNA 
is identical to that predicted for residues 1-38 and 68-473 of W08D2.4. 

Fig 2A shows a comparison of the deduced amino acid sequences of the C. elegans cDNA 
CeD6.1 and the C. elegans predicted protein W08D2.4. (MywormD6=CeD6.1; 
cew08d2=ORF W08D2.4.) 

Fig 2B shows a comparison of the deduced amino acid sequences of the borage A 6 
desaturase (Sayanova O et al (1997) Proc. Natl Acad. Sri. USA 94, 421 1-4216) and the C. 
elegans cDNA CeD6.1. ( Koof&6=Borage officianalis A 6 desaturase; ceeld6=CeD6.1.) 

Fig 3 shows methyl esters of total lipids of S. cerevisiae grown under inducing conditions 
(linololate and galactose). 

Fig 4 shows GC-MS analysis of the novel peak identified in yeast carrying pYCeD6.1. 

Fig 5 shows a simplified version of the metabolism of n-6 essential fatty acids in mammals. 

Fig 6 shows fatty acid and methyl esthers of leaf material from either control transformed 
Arabidopsis plant (A) or transformed Arabidopsis plant expressing the C. elegansA 6 
desaturase (B). 

Example 1 - Isolation of A* Desaturase Gene and Expression in Yeast 

The NCBI EST sequence database was searched for amino acid sequences using a known 
borage A 6 fatty acid desaturase (Sayanova O et al (1997) supra) and limiting the search to 
sequences containing a variant histidine box Q-X-X-H-H. 
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C elegans ESTs were identified. They were further characterised by searching the 
C elegans EST project database (Prof. Y. Kohara lab (National Institute of Genetics, 
Mishima, Japan); DNA Database of Japan) to identify related cosmid clones. 

A partial 448 base pair cDNA clone designated as yk436bl2 identified by these searches 
was obtained from the C. elegans EST project, and this was used to screen a C. elegans 
cDNA library (mixed stage; also supplied by Prof Kohara) This indicated that the clone 
yk436bl2 was homologous to part of a gene present on cosmid W08D2 (Genbank 
accession number Z70271), which forms part of chromosome IV. Bases 21-2957 of cosmid 
W0D2 are predicted by the protein prediction program Genefinder (Wilson R et al (1994) 
Nature 368 32-38 to encode an ORF of 473 residues which is interrupted by 5 introns. 
Wilson, R. et al disclose part of the sequence of chromosome HI of C. elegans. A number 
of positives were identified and further purified, and full length clones were confirmed by 
sequencing to encode a transcript likely to have been transcribed from the gene designated 
W08D2.4, on cosmid W08D2, as determined by database searching of the genes sequenced 
by the C. elegans genome project 

Examination of this predicted polypeptide (designated W08D2.4 by the Sanger Centre 
Nematode Sequencing Project, Hinxton, UK) revealed that it had a number of 
characteristics reminiscent of a microsomal fatty acid desaturase, including three histidine 
boxes. However, the predicted protein sequence indicated the presence of an N-tenninal 
domain similar to cytochrome b 5 , containing the diagnostic H-P-G-G motif found in 
cytochrome b 5 proteins (LedererF (1994) Biochimie. 76, 674-692). Since the A 6 desaturase 
isolated by us from borage also contained an N-terminal b 5 domain, this indicated that 
W0D2.4 may encode a D 6 desaturase. 

Closer examination of the sequence revealed the presence of the variant third histidine box, 
with an H-»Q substitution (again as observed in the borage A 6 desaturase). The degree of 
similarity between W08D2.4 and the borage A 6 desaturase is <52% and is therefore low. 
The figure of <3 1% obtained for identity is also low. 
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Since W08D2.4 was encoded by a gene containing many (6) introns, it was necessary to 
isolate a full length cDNA to verify the sequence predicted by the Genefinder program , 
and to also allow the expression of the ORF to define the encoded function. 

A cDNA library was screened with the EST insert yk436bl2 (generously provided by 
Prof Y. Kohara) and a number of positive plaques were identified. These were further 
purified to homogeneity, excised, and the largest inserts (of -1450 bp) from the resulting 
rescued phagemids were sequenced. This confirmed that the cDNAs isolated by us were 
indeed homologous to W08D2.4, with the 5' and 3' ends of the cDNA being equivalent to 
bases 9 and 3079 of the sequence of cosmid W08D2. Since the ATG initiating codon 
predicted by the Genefinder program to be the start of gene product W08D2.4 was indeed 
the first methionine in the cDNA clone, we reasoned that we had isolated a bona fide full 
length cDNA. The DNA sequence and deduced amino acid sequence of one representative 
cDNA clone (termed pCeD6.1; 1463 bp in length) is shown in Fig 1; the deduced amino 
acid sequence is identical to that predicted for W08D2.4 over the majority of the protein. 
The positions of the N-terminal cytochrome b 5 domain and the variant third histidine box 
are underlined. The deduced amino acid sequence of this cDNA is identical to that 
predicted for residues 1 -38 and 68-473 of W08D2.4. 

Vowever, DNA sequences encoding residues 38-67 (Y-S-I L-Y-F) predicted for 

W08D2.4 are not present in the cDNA clone. This means that the deduced amino acid 
sequence of CeD6.1 is in fact 443 amino acids long, as opposed to that predicted for 
W08DG.4, which is 473 residues in length. The only other difference between the two 
amino add sequences is an M-»V substitution at residue 401 , resulting from an A->G base 
change (tffcse 1211). The two sequences are compared in Fig 2A, as is the deduced amino 
acid sequence of the borage A 6 desaturase and that of CeD6.1 (Fig 2B). The extra sequence 
predicted for ^0802.4 is likely to derived from incorrect prediction of intron-exon 
borders. \ 



Note the presence ortfie H-P-G-G cytochrome b 5 motif in the N-terminus (encoded by 
bases 96-108) and thell Q substitution in the third histidine box (encoded by bases 
1157-1172). \ 
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The coding sequence of W08D2.4 was introduced into the yeast expression vector pYES2 
by PCR. Oligonucleotides with 5' overhangs were used to introduce Kpnl and Sad sites at 
the 5* and 3* ends respectively. The fidelity of the construct was checked by in vitro 
transcription and translation using the TnT system (Promega). 

Specifically, clone pCeD6.1 was then used as a template for PCR amplification of the 
entire predicted coding sequence (443 amino acid residues in length), and cloned into the 
yeast expression vector pYES2 (Invitrogen) to yield pYCeD6. The fidelity of this 
PCR-generated sequence was checked in vitro transcription/translation of the plasmid, 
using the T7 RNA polymerase promoter present in pYES2. 

Using the Promega TnT coupled transcription/translation system,translation products were 
generated and analysed by SDS-PAGE and autoradiography as per the manufacturer's 
instructions. This revealed (data not shown) that the plasmid pYCeD6 generated a product 
of ~55kD, whereas the control (pYES2) failed to yield any protein products, indicating that 
the construct was correct. 

The resulting plasmid was introduced into yeast (5. cerevisiae) by the lithium acetate 
method (Guthrie C, Fink GR (1991) Meths Enz 194) and expression of the transgene was 
induced by the addition of galactose. The yeast was supplemented by addition of 0.2 mM 
linoleate (sodium salt) in the presence of 1% tergitol NP-40. 

Transformation and selection of yeast able to grow on uracil-deficient medium revealed 
yeast colonies carrying the recombinant plasmid pYCeD6 by virtue of the URA3 selectable 
marker carried by pYES2. Expression of pYCeD6 was obtained by inducing the GAL 
promoter that is present in pYES2. This was carried out after the cells had been grown up 
overnight with raffinose as a carbon source, and the medium supplemented by the addition 
of linoleate (18:2) in the presence of low levels of detergent. This later addition was 
required since the normal substrate for A 6 desaturation is 18:2 fatty acids, which do not 
normally occur in 5. cerevisiae. 



Yeast total fatty acids were analyzed by GC of methyl esters . Confirmation of the presence 
of GLA was carried out by GC-MS (Sayanova et al (1997) supra). 
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In more detail, the cultures were then allowed to continue to grow after induction, with 
aliquots being removed for analysis by GC. When methyl esters of total fatty acids isolated 
from yeast carrying the plasmid pYCeD6 and grown in the presence of galactose and 
linoleate were analyzed by GC, an additional peak was observed (Fig 3). In Fig. 3 Panel A 
is yeast transformed with control (empty) vector pYES2, panel B is transformed with 
pYCeD6.1. The common fatty acid-methyl esters were identified as 16:0 (peak 1), 16:1 
(peak 2), 18:0 (peak 3), 18:1 (peak 4), 18:2 (peak 5; supplied exogenously). The additional 
peak (6) in panel B corresponds to 18:3 GLA, and is indicated by an arrowhead. This had 
the same retention time as an authentic GLA standard, indicating that the transgenic yeast 
were capable of A 6 -desaturating linoleic acid. No such peaks were observed in any of the 
control samples (transformation with pYES2). The identity of this extra peak was 
confirmed by GC-MS, which positively identified the compound as GLA (Fig 4). In the 
Figure 4 experiment, the sample was analyzed for mass spectra as before (Sayanova O et al 
(1997) Proc. Natl Acad. ScL USA 94, 421 1-4216), and the data used to search a library of 
profiles. The sample was identified as GLA. A comparison of the mass spectra of the novel 
peak (A) and authentic GLA (B) is shown; visual and computer-based inspection revealed 
them to be identical. This confirms that CeD6.1 encodes a C. elegans A 6 desaturase, and 
that this cDNA is likely to be transcribed from the gene predicted to encode ORF 
W08D2.4, though the deduced amino acid sequence of CeD6.1 is 30 residues smaller than 
thatofW08D2.4 

Example 2 -Expression of C.elegans A 6 desaturase in plants 

The coding sequence of the C. elegans A 6 desaturates was subcloned into a plant 
expression vector pJD330, which comprises a viral 35S promoter, and a Nos terminator. 
The resulting cassette or promoter/coding sequence/terminator was then subcloned into the 
plant binary transformation vector pBin 19, and the resulting plasmid was introduced into 
Agrobacterium tumefaciens. This Agrobacterium strain was then used to transform 
Arabidopsis by the vacuum-infiltration of inflorescences. Seeds were harvested and plated 
onto selective media containing kariamycin. Since pBin 19 confers resistance to this 
antibiotic, only transformed plant material will grow. Resistant lines were identified and 
self-fertilized to produce homozygous material. Leaf material was analyzed for fatty acid 
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profiles using the same method as used for the expression of the nematode desaturase in 
yeast. Fatty acid methyl esthers were separated by GC, and novel peaks shown in Figure 6 
identified by comparison with known standards and GCMS. Two novel peaks can be seen 
in (B) which were identified asy -linolenic acid (peak 1) and octadecatetraenoic acid (peak 
2). These are the products of A 6 desaturation of the precursor fatty acids linoleic acid and a 
-linolenic acid, respectively. 

The inventors have shown that a C. elegans cDNA (CeD6.1) encodes a A 6 desaturase, and 
that this sequence is identical with the predicted ORF W08D2.4, except for a 30 residue 
insertion present in the N-terminal region of the latter protein. Whether the deduced amino 
acid sequence predicted for CeD6. 1 represents a splicing variant of W08D2.4, or is a result 
of a mis-prediction of the intron/exon junctions by the Genefinder programme is unclear. 
However it is clear that CeD6. 1 encodes a A 6 desaturase. 

The ORF encoded by the this C. elegans sequence appears to be related to the higher plant 
A 6 fatty acid desaturase previously isolated by us (Sayanova O et al (1997) supra), in that 
they both contain N-terminal domains which show homology to cytochrome b 5 . 
Microsomal fatty acid desaturases have been demonstrated to use free microsomal 
cytochrome b 5 as their electron donor (Smith MA, et al (1990) Biochem. J. 272, 23-29, 
Smith MA et al (1992) Biochem. J. 287, 141-144)), and the vast majority of identified 
sequences for these enzymes appear not to contain this additional cytochrome b$ domain 
(Okuley J et al (1994) Plant Cell 6, 147-158, Aronel V. et al (1992) Science 258, 
1 353- 1 355 and Napier, J.A. et al ( 1997) Biochemical J, 328:7 1 7-8). 

Prior to the present invention only two examples of cytochrome bs-domain-containing 
desaturases were known, one being the borage A 6 desaturase, and the other being the yeast 
microsomal A 9 (OLE1) desaturase (Napier J A et al (1997) Biochemical J, supra and 
Mitchell AG, Martin CE (1995) J. Biol Chem 270, 29766-29772). OLE1, however, 
contains a C-terminal cytochrome b 5 domain (Napier J A et al (1997) Biochemical /, in 
press and Mitchell AG, Martin CE (1995) J. Biol Chem. 270, 29766-29772). The reason 
for the cytochrome b 5 may be that the A 6 desaturase is a "front-end" desaturase. (A 
"front-end" desaturation can defined as the final desaturation reaction on the fatty acid 
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chain, usually introducing double bonds between a pre-existing bond and the A-end of the 
carboxy group (Mitchell AG, Martin CE (1995) 7. Biol Chem 270, 29766-29772 and 
Aitzetmuller K, Tseegsuren, N ( 1 994) J. Plant Physiol 143, 538-543).) 

In any event, it is now believed to be the case that both a variant histidine box and an 
N-terminal cytochrome b 5 domain are conserved in both animals and plants, as evidenced 
by their presence in both the borage and nematode A 6 desaturases. 

The invention may therefore allow the identification of other A 6 desaturases and also other 
"front-end" desaturases to be identified by the presence of these motifs. 



